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Integrating Genomics, Proteomics, and Bioinfbrmatfcs 

John N. Weinstein, M.D., Ph.D. ! 

^SS, ^? een c " ,tod the "Post-Genomic" Era in biomedical sciences (I would 
! ^ I' new A demands are b ^9 Placed on bioirjformatics. For most omic 
£ I 6 ^? ? haS6S: A ^ a ' ysis and lnter Pretation [4}. The analysis phase is principally 

the province of statisticians and experts in machine learning; trie interpretation phase s 
pnncipally the province of biologists, preferably with encyclopedic knowledge of Se tools and 
database resources necessary to put whole lists of genes and gene products into perspective. 

Our own work along these lines has principally related to the 60 cell lines (the NCI-60) r5-71 
w»t J?n nnn atl ° na ' ^J 0 * lnstitute " s dnj 9 discovery program. Thefee cells have been treated 
with >70,000 compounds one at a time and independently over a 1&-year period Cell lines in 
culture do not fully reflect cells in vivo, of course, but, historically, most of our knowledae of 
molecular pharmacology and targets has come from cultured cells! not clinical material We 
and our colleagues have assessed expression patterns in the NCF-6G using 2-D protein oel 
electrophoresis [8], high density "reverse-phase" protein arrays, cDNIA microarrays [B 101 and 
oligonucleotide chips [11]. To find patterns in the data, we have Ihen developed new data 
visualizations, including the familiar Clustered Image Map [12], arid a tool (MedMlner) that 
streamlines literature searches on genes and drugs [13], We arfid our collaborators next 
characterized the cells at the DNA level by comparative genomic hybridization (CGH) spectral 
karyotyping, array-CGH, and SNP chip and then developed algorithms and a program package 
called LeadScope/LeadMiner [14]. This package makes it possible lb predict which molecular 
substructures will be found in drugs that are active against cells expressing large amounts of a 
selected gene - or vice versa. Finally, we have developed the rirogram package GEEVS 
(GEnome Exploration and Visualization System) to integrate all ofi these disparate types of 
data at the DNA, RNA, protein, functional, and pharmacological levels. Part of the GEEVS 
package, and also for independent use, are MatchMiner [15] and doMiner, two programs of 
major utility that facilitate biological interpretation of omic information). One clinically interesting 
pharmacogenomic [10] outcome of these various studies: [--asparaginase may prove useful for 
Ef^Ef ovari ?n. cancers that express only low levels of aspWagine synthetase. See 
http://discover.nci.nih.gov. ! 
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